take:
https://arxiv.org/abs/2005.04536 Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement Learning Problems
https://ieeexplore.ieee.org/document/477409 Design and implementation of a multicomputer interconnection network using FPGAs
https://ieeexplore.ieee.org/document/8533511 Application Partitioning on FPGA Clusters: Inference over Decision Tree Ensembles
https://ieeexplore.ieee.org/document/9116220 BNNsplit: Binarized Neural Networks for embedded distributed FPGA-based computing systems
https://ieeexplore.ieee.org/document/1106696 The design of the Amalgam reconfigurable cluster
https://ieeexplore.ieee.org/document/7529857 Clustering and Mapping Algorithm for Application Distribution on a Scalable FPGA Cluster
https://ieeexplore.ieee.org/document/7285090 FPGA-based accelerator platform for big data matrix processing
https://ieeexplore.ieee.org/document/7991036 Hadoop cluster with FPGA-based hardware accelerators for K-means clustering algorithm
https://ieeexplore.ieee.org/document/9042006 Hadoop ZedBoard cluster with GZIP compression FPGA acceleration
https://ieeexplore.ieee.org/document/7152609 Accelerating Machine Learning Kernel in Hadoop Using FPGAs
https://ieeexplore.ieee.org/document/7395718 FPGA-Accelerated Hadoop Cluster for Deep Learning Computations
https://ieeexplore.ieee.org/document/9029124 The Library for Hadoop Deflate Compression Based on FPGA Accelerator with Load Balance
https://ieeexplore.ieee.org/abstract/document/6927472?anchor=footnotes Interconnect for commodity FPGA clusters: Standardized or customized?
https://ieeexplore.ieee.org/document/6687430 Integration of a Highly Scalable, Multi-FPGA-Based Hardware Accelerator in Common Cluster Infrastructures
https://ieeexplore.ieee.org/document/6750072 The SpiNNaker Project
https://ieeexplore.ieee.org/document/1620784 OLD! RAMP: research accelerator for multiple processors - a community vision for a shared experimental parallel HW/SW platform
https://ieeexplore.ieee.org/document/5981829 OLD! NeuFlow: A runtime reconfigurable dataflow processor for vision
https://ieeexplore.ieee.org/document/803684 OLD! Implementing an API for distributed adaptive computing systems
https://ieeexplore.ieee.org/document/707919 OLD! SLAAC: a distributed architecture for adaptive computing

auxiliary papers:
https://ieeexplore.ieee.org/document/9188138 Exploration of Clustering Algorithms effects on Mesh of Clusters based FPGA Architecture Performance
https://ieeexplore.ieee.org/document/6239804 Bluehive - A field-programable custom computing machine for extreme-scale real-time neural network simulation

don't take:
A survey of FPGA-based accelerators for convolutional neural networks
Distributed FPGA-based architecture to support indoor localisation and orientation services
A Distributed Canny Edge Detector: Algorithm and FPGA Implementation
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning
FA3C: FPGA-Accelerated Deep Reinforcement Learning
Distributed Deep Learning With GPU-FPGA Heterogeneous Computing
